Improving Video Retrieval Using Multilingual Knowledge Transfer

نویسندگان

چکیده

Video retrieval has seen tremendous progress with the development of vision-language models. However, further improving these models require additional labelled data which is a huge manual effort. In this paper, we propose framework MKTVR, that utilizes knowledge transfer from multilingual model to boost performance video retrieval. We first use state-of-the-art machine translation construct pseudo ground-truth video-text pairs. then learn representation where English and non-English text queries are represented in common embedding space based on pretrained evaluate our proposed approach four datasets such as MSRVTT, MSVD, DiDeMo Charades. Experimental results demonstrate achieves all outperforming previous Finally, also video-retrieval dataset encompassing six languages show outperforms zero-shot setting.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Deliverable Speech-to-Text Systems with Multilingual Knowledge Transfer

This paper reports our recent progress on using multilingual data for improving speech-to-text (STT) systems that can be easily delivered. We continued the work BBN conducted on the use of multilingual data for improving Babel evaluation systems, but focused on training time-delay neural network (TDNN) based chain models. As done for the Babel evaluations, we used multilingual data in two ways:...

متن کامل

Knowledge Transfer: Revisiting Video

Knowledge transfer has been an important issue for organizational knowledge management programs. This article reviews the plethora of user-generated video activity and the issues it creates for knowledge management activities. Video’s media richness combined with its ability to convey rich narratives can facilitate sensemaking and learning. However, structure and culture are important factors t...

متن کامل

Using Knowledge Representation Languages for Video Annotation and Retrieval

Effective usage of multimedia digital libraries has to deal with the problem of building efficient content annotation and retrieval tools. In particular in video domain, different techniques for manual and automatic annotation and retrieval have been proposed. Despite the existence of well-defined and extensive standards for video content description, such as MPEG-7, these languages are not exp...

متن کامل

Improving Question Retrieval in Community Question Answering Using World Knowledge

Community question answering (cQA), which provides a platform for people with diverse background to share information and knowledge, has become an increasingly popular research topic. In this paper, we focus on the task of question retrieval. The key problem of question retrieval is to measure the similarity between the queried questions and the historical questions which have been solved by ot...

متن کامل

Improving Retrieval Performance using World-Knowledge Generated Features

Information Retrieval is the task of retrieving information items (documents, images, videos etc.) most relevant to a given user query. The common approach in textual IR systems is to index and retrieve documents by selecting representative key words and phrases within them, using various statistical, linguistic and semantic methods, and viewing each document as a vector in the vector space def...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-28244-7_42